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Abstract 

OV 

0^ . A derivation step in a Graph Interpolation Grammar has the effect of scanning an input 

token. This feature, which aims at emulating the incrementality of the natural parser, restricts 
the formal power of GIGs. This contrasts with the fact that the derivation mechanism involves a 
context-sensitive device similar to tree adjunction in TAGs. The combined effect of input-driven 
derivation and restricted context-sensitiveness would be conceivably unfortunate if it turned out 
that Graph Interpolation Languages did not subsume Context Free Languages while being par- 
tially context-sensitive. This report sets about examining relations between CFGs and GIGs, 
and shows that GILs are a proper superclass of CFLs. It also brings out a strong equivalence 
between CFGs and GIGs for the class of CFLs. Thus, it lays the basis for meaningfully investi- 
gating the amount of context-sensitiveness supported by GIGs, but leaves this investigation for 
further research. 
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oo 1 Introduction 

1.1 Graph Interpolation Grammars as automata 



Graph Interpolation Grammars are indifferently viewed as grammars or automata. That is 
to say that a derivation step in a GIG is an automaton move during which one token of the 
input string is scanned. In this perspective, GIG rules can be viewed as the finite control of an 
automaton. 



r 
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^ ■ 1.2 Rationale for GIG graphs 

H ' 

The output of a GIG is a graph rather than a tree. If trimmed down to its bare essentials, a GIG 
graph can be considered as a particular representation of a tree, in which a node is connected 
to its leftmost child through a parent-of edge, and each child is connected to its right sibling 
through a predecessor- of edge. Now, the advantages of this representation are the following 

1. A parse subgraph is allowed to assign a smaller arity to a given node than the complete 
graph that includes it, so that arities can be determined incrementally. For example, given 
a sentence starting with John thinks Mary . . . , what can be hypothesized is that Mary is 
the subject of a verb, but the arity of this verb is unknown at this stage of the parsing. 

2. In addition to tree adjunction, which shifts a subtree further down the parse tree, parse 
graphs support a similar operation which shifts a subgraph further right, thus allowing sets 
of constructions such as the following to be viewed as paradigms. (Alternating paradigm 
constituents appear in square brackets and gaps are represented by squares). 

Who did you [invite] □? 
Who did you [happen to invite] □? 
Who did you [think I invited] □? 
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Such a paradigm is captured in GIG by considering that the basic construction (with the 
constituent invite) can be expanded by interpolating a subgraph to the left of the gap. One can 
expect it to be hard, if possible at all, to represent such a constituent as think I invited (without 
the object gap) by a tree, especially if intuitive phrase structure is to be preserved. Hence the 
need for a more versatile structure than standard parse trees. 

1.3 Expressive power of graph interpolation 



The operation on which GIGs are based, graph interpolation, is basically a tree adjunction [ JLT75 



VSW94 ] applied to a pair of GIG graphs. On the one hand, graph interpolation is more general 
than tree adjunction, but, on the other hand, its application obeys a very strong constraint, 
namely that it be performed in response to the scanning of a token. Owing to this constraint, 
it is even legitimate to ask whether Graph Interpolation Languages subsume Context-Free Lan- 
guages. So, before solving the important problem of GIGs relation to TAGs in term of formal 
power, it is worth comparing GIGs with CFGs. This comparison is the subject of the present 
report. 

1.4 Content of the report 

This report proves that 

• Graph Interpolation Languages include Context-Free Languages (Section ||), 

• Graph Interpolation Languages properly include Context-Free Languages (Section ^), 

• GIGs are strongly equivalent to CFGs on the class of CFLs (Section [|), 

• the intersection of CFLs and deterministic GILs includes the class LL(1) (Section ||). 

A preliminary to these results is the definition of a GIG, which is given in the next section. 

2 Definition of a Graph Interpolation Grammar 



A linguistic presentation of Graph Interpolation Grammars was given in [Lar98|. The definition 
given here deliberately ignores all features not directly relevant to the purpose of defining GIGs 
as automata. 

• Multiple-interpolation rules will not be considered, for the formal power they contribute 
is excessive in the perspective of a comparison with CFGs. 

• A fixed word order will be assumed. 

• Phrase heads and grammatical functions will be ignored, which shows that they can be 
decoupled from parsing issues. 

• Node label subtyping, which has no impact on formal power, will not be made use of. 

Apart from these introductory remarks, the present report can be read without any prior 
knowledge of Graph Interpolation Grammars. 

2.1 Parse graphs 

A parse graph is a representation of a tree in which the relation parent-of is broken down into a 
first-child edge and zero or more right-sibling edges. More precisely, a parse graph T is a tuple 
(N, L, r, V, E, 7, E, F), where 

• N and L are disjoint sets of nodes (respectively nonlexical nodes and lexemes), 

• r is a distinguished member of N, called the graph root, 

• V and S are disjoint sets of node labels (respectively variables and symbols), 

• 7 is the label assignment, i.e. a function in N x V U L x 'S, 
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• E is the set of first-child edges, i.e. a function in N x {N U L), 

• F is the set of right-sibling edges, i.e. a function in N x N. (This definition precludes 
lexical nodes, i.e. members of L, from having siblings.) 

2.1.1 Constraints on E and F 

• E and F are functions, i.e. \E(n)\ < 1 and \F(n)\ < 1 for all n in TV. (The notation lZ(x), 
where 1Z is a relation, denotes the set {y | (x, y) € T^}-) In a complete parse graph, N is 
the domain of E, i.e. \E(n)\ = 1 for all n in iV. 

• Every node except the root has exactly one immediate dominatoi|] by E U F, i.e. 
\{E U F)" 1 ^)! = 1 for all n in N - {r} U i. 

• Every node in i is a sing le child with no descendant, i.e. E(l) = F(l) = = for 
all / in L. 



2.2 Path interpolation 

A path interpolation consists in plugging into a parse graph T\ = (Nx, Lx, Tx, V, S, 71, Ex, Fx) a 
disjoint parse graph T2 = {N2, L2, s, V, S, 72, i?2, ^2) by substituting a path of T2 for a node <? 
of Ti that is called the anchor of the interpolation. The path that is substituted is defined by 
3 nodes of T2, s, the source of the path (which is the root of T2), t, the target of the path, and 
s', the successor of s in the interpolation pathQ. 

Owing to properties of Ei and F2 (i.e. existence of at most one immediate dominator for 
any node), there is at most one path from s to t. A constraint on path interpolation is that 
there be exactly one path (of length or more) from s to t. 

There are 3 types of interpolation. 

• In a down-shift, (s, s') is in E2. A down-shift has the effect of a tree adjunction. 

• In a right-shift, (s, s') is in F2. A right-shift has the effect of shifting a sibling edge of Ti 
further right in the resulting graph. 

• In an e-shift, s — s' — t. The effect of an e-shift is to insert a subgraph at the anchor q. 

Formally, an interpolation X is a function that maps a tuple of the form (Tx,T2,q,t) to a 
parse graph T3. (q is the anchor of the interpolation, and t the target, i.e. the endpoint, of the 
interpolation path.) Let Tx and T 2 be defined as above, then T 3 = (N 3 , L 3 ,r 3 , V, S, 73, E 3 ,F 3 ) 
is defined as follows, where s is the root of I^ (i.e. the interpolation source) and s' is the node 
that immediately follows s in the interpolation path. 

• iV 3 = Nx - {q} U N 2 

• L 3 = Lx U L 2 

• r 3 = rx 

• 73 = 7i U 72 



A node n is said to dominate a node m in a rooted graph whenever every path from the root to m goes through 
n. The term dominator is here preferred to predecessor or successor to avoid confusions with the successor relation 



introduced in Section 2.2.2. 

2 Examples of interpolations are shown on Figure [I]. 
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E% — E\ U E2 

// parent of q, if any, becomes parent of s 

- {(n,q)\ neN^ 

U {(n,s)\ (n,q)eE 1 } 
// child of q, if any, becomes child of t (down-shift) or s (right-shift) 

- {(q,n)\ nem} 

U {(m,n) I (g, n) E E± and 

(((a, a') E E 2 and m = t) 
or ((s, a') ^ E% and to = s))} 



F3 — Fi U -F2 

// Ze/f sibling of q, if any, becomes left sibling of s 

- {(n,q)\ neN-t} 

U {(n,s)\ (n,q)€F 1 } 
// right sibling of q, if any, becomes right sibling oft (right-shift) or s (down-shift) 

- {(q,n)\ n€Ni} 

U {{m,n) I (q, n) E F\ and 

(((a, a') E F 2 and m = t) 
or ((a, a') ^ F 2 and to = a))} 

2.2.1 Definition constraints for Z 

1. Disjunction constraint Ti and T2 are disjoint, i.e. iVi (~l iV2 = (from which it follows 
that the other pairs of sets are disjoint too, given constraints on parse graph edges). 

2. Lexicalization constraint T 2 is mono-lexicalized, i.e. \L 2 \ = 1. 

3. Context constraint Ti is lcftmost-lexicalized, meaning that its frontier, i.e. its yield 
over (Ni U L\)* , is in the regular language L*iVj*. (The concept of frontier is defined more 
precisely in the next section.) 

4. Frontier constraint Given xa the frontier of Ti, with x in L\ and a in N£, and Sa/3 the 
frontier of T 2 , with a in L 2 and (5 and /3 in iVj , the frontier of T3 has prefix a;aQ. 

Figure [l] illustrates the three forms of interpolation. 

2.2.2 Anchor position 

Given a triple (Ti,T 2 ,t), the constraints just enumerated restrict the set of possible positions 
for the anchor q with respect to the last lexeme of Ti if (Tx, F%, q, t) is to belong to the definition 
domain of interpolation. As a matter of fact, q is either an ancestor or an immediate successor 
of the last lexeme of T\, as this section will attempt to proveQ. 

Immediate successor The notion of "immediate successor" is defined as follows. Let < be 
the "successor" relation in a parse graph T, defined as follows. (The operator o denotes relation 
composition or, equivalently, edge concatenation.) 



< = ((F- 1 )* or 1 )* 0F0E* 



Then, the "immediate successor" relation, noted as <C, holds between to and n if and only 
if to < p < n implies that to = p or n = p. 

3 For details on the possible forms of the frontier of F3, see Section^. 

4 This result does not imply that there is a single choice of q for given T\, T2, and t. 
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(c) right-shift 

Figure 1: The three types of interpolation 
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Graph frontier This relation makes it possible to define the frontier of a parse graph 
r = (N, L, r, V, E, 7, E, F) more precisely than was done in the previous section. Let T equal 
{n G N U L E(n) = 0} in: 

fronticr(r) = {w | aicf and 

Vx, y E T* , Vm, n G T : w = xmny =>mCn} 

The frontier of a parse graph is a string containing all of and only its terminal nodes in 
■C order. (The definition of <C implies that \w\ = |T|.) 

Anchor position with respect to the last lexeme Let \i be the last lexeme of V x 
and v the single lexeme of Yi . It will now be shown that the constraint /i <C v (which results 
from the frontier constraint) can only be satisfied if q, the anchor, is an ancestor or immediate 
successor of p. This theorem does not cover the initial interpolation step, which starts from a 



context Ti that consists of a single nonterminal node (Section 2.4) 



Theorem 1 If L\ is nonempty, X is defined for (Ti,T2,q,t) only if q is either an ancestor 
or an immediate successor of the last lexeme ofT\. I.e., let p be the rightmost lexeme in the 
frontier ofT\, either q £ (F^ 1 * o E x )*(p,) or p <C q must hold. 

Proof outline The constraint p <C v can be decomposed as follows, where to and n are 
nodes of F 3 . 

p -TT^-i TO n ^ v 

^ F 3 oE 3 - 1 F 3 E 3 

In order to locate q with respect to \i, we will locate s, the root of I^, and then use the fact 
that s occupies in T3 the position that q occupies in Y\. More precisely, the unique E\ Uiq-path 
from r\ to q contains the same nodes (down to q itself) as the unique E% U -F^-path from r% to s. 

Now, as the root of T2, s dominates v, which implies that it lies somewhere on a path 
containing r^, to, n, and v (in this order), since every node has a single immediate dominator 
by E 3 U .Fa- 
Supposing s properly dominates to, then to = t, for the path from s to to occupies the place 
of an N\ node, namely q, on the path from r\ to \x. (By definition of an interpolation, r% = r 3 .) 
Conversely, q occupies the place of m in Ti; therefore q is an ancestor of [i. 
The same conclusion can be made if s equals to. 

If s is dominated by n, we have (i<Cs, from which it follows that /z <C q in Ti. Therefore, q 
is an immediate successor of /i. □ 



2.3 Interpolation rule 

An interpolation rule p is a tuple (A, T2,t), where A is a variable in V and t, the interpolation 
target, is a node of T2- Rule p is said to match a parse graph Fi if the interpolation function is 
defined for (Fi,r 2 ,g,t) for some q in 7 1 " 1 (A). 

The variable A is called the context pattern of the rule, T2 is called the addendum, and a 
matching Fi the context to which the rule is applied. 

2.3.1 Rule notation 

The clearest way of representing an interpolation involves a 2-dimensional representation of the 
addendum, as on Figure ^. On the other hand, since a parse graph is a representation of a parse 
tree, which is itself a representation of a context-free derivation, an addendum can be represented 
as a CF derivation. Thus, a rule (A,T2,t) can be represented as a left-hand side consisting of 
the context pattern A, a separating right arrow, and a right-hand side in which T2 is represented 
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as a leftmost derivation, and t is dotted. For example, the e-shift on Figure [j] can be noted as 
A — > (A => a). Likewise, the down-shift can be noted as V (V =^=> VA =j=> Va) and 

the right-shift as V -► (VS w5 uW). 

2.4 Graph Interpolation Grammar 

A Graph Interpolation Grammar is a tuple (V, E, P, S 1 ), where P is a set of interpolations rules, 
V and E are the label sets used by these rules, and S is a member of V called the axiom. 

At the start of a GIG derivation, the context is a subgraph consisting of one node labeled 
with the axiom. 

Each step matches a rule of P with the context and applies the corresponding interpolation, 
yielding a new context. 

3 Graph Interpolation Languages form a superclass of 
CFLs 

3.1 A definition of Graph Interpolation Languages 

Let an Instantaneous Description be defined as a tuple (T,w), where the string to parse is of 
the form vw, with v the yield of the context T. Initially, T consists of a single node labelled 
with the axiom, and v is empty. An interpolation I((T, T2, q, t)) — T3 maps (r, ax) to (Ts,x), 
with yield{T^) = yield(T).a. A string w is accepted by a GIG if applicable interpolations map 
the initial ID to (Tf,e), with yield(Tf) — w. The definition of a Graph Interpolation Language 
follows in the classical way. 

3.2 Graph Interpolation Languages as a superclass of Context-Free 
Languages 

Theorem 2 Any CFL not containing the string e can be recognized by a Graph Interpolation 
Grammar. 

Proof outline Any CFL not containing the string e can be described by a grammar in 
Greibach Normal Form, i.e. a CFG (V, S, P, S) in which every production is of the form A — > 07, 
with A in V, a in E, and 7 in V* . 

To build a GIG G' from a GNF grammar G, consider each production p as a GIG rule p 
whose context pattern consists of the left-hand side of p, and whose addendum represents the 
subtree obtained in a CFG-derivation by substituting the right-hand side for the left-hand side, 
except that the leftmost symbol of the right-hand side is replaced by the preterminal symbol 
X, X £ V U E, as shown in Figure |. 



A -> A 



B 



Figure 2: GIG rule built from the production A — > aB\...B, t 
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The GIG thus built is a tuple (V , E, P', S), where V equals V U {X} and P' is the set of 
GIG-rules just described. 

It is easy to prove by induction that the derivation produced by this GIG is a leftmost 
derivation of G. 

Indeed, initially, the context consists of a node labeled S, whose yield is therefore a leftmost 
sentential form of G. 

Supposing the yield of the current context is a leftmost sentential form xA±...A n , x in £*, 
then there will be an applicable GIG rule if and only if there is a production A\ — > aB\...B m 
in P, and this rule will map the context to a context whose yield is xaB±...B m A2...A n , i.e. a 
leftmost sentential form of G. □ 



4 GILs form a proper superclass of CFLs 

Theorem 3 The class of Context-Free Languages is properly included in the class of Graph 
Interpolation Languages. 

Proof outline Now it has been shown that every CFL not containing e is recognized by 
some GIG, it remains to show that there are languages which can be recognized by Graph 
Interpolation Grammars but not by Context-Free Grammars. 

The language a n b n c n d n , which is a classic example of a context-sensitive language, is recog- 
nized by the GIG ({S, A, B, C, D}, {a, b, c, d}, P, S), where P consists of the following rules: 



(1) 


S - 


- (S = 


>ASD = 


> aSD => aBSCD => aBeCD) 


(2) 


S - 


-> (S = 


> ASD = 


> aSD =4> aBSCD) 


(3) 


B - 


-> (B = 


>b) 




(4) 


C - 


- (R- 






(5) 


D - 


- (R = 


» d) 





A 2-dimensional representation of these rules appears on Figure 



s -> s s -> s 

I I 

(1) A *S (2) A *S 

II II 
a B ^ S <~ C a B 



I 



B -> B C -> C D -> D 

I II 
(3) b (4) c (5) d 



Figure 3: GIG rules for a n b n c n d n 

As production (1) requires the presence of an S'-node in the context frontier, it applies ex- 
clusively to the initial context. On the other hand, the frontier constraint requires that the 
down-shift (2) apply exclusively to the middle S of the previous interpolation, i.e. the S that 
is an immediate successor (by <c) of the last lexeme. Figure [| shows the first two steps of the 
unique derivation of aabbccdd. □ 
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Figure 4: Initial GIG derivation steps for aabbccdd 



5 GILs strongly subsume CFLs 

Define a 2-tiered Context-Free Grammar to be a CFG containing no production of the form 
A — > aa/3 in which a or /3 is a nonempty string in (V U E)* and a is a terminal symbol. The 
graph representations of trees generated by a 2-tiered CFG obey the constraint that lexical 



nodes have neither descendants nor siblings (Section 2.1.1). A simple way of making an arbitrary 
CFG G — (V, E,P, S) 2-tiered is to (i) add into V a preterminal symbol X a for each a in E, 
(ii) substitute X a for a in all productions of P, and (Hi) add into P the production X a —> a 
for each a in E. The resulting CFG G' remains close enough to the original grammar G for 
any semantic computation on a G tree to map straightforwardly to its counterpart G' tree, and 
vice versa. Thus, conversion to 2-tiered form is semantically neutral, and it seems therefore 
legitimate to say that GIGs are strongly equivalent to CFGs on the class of CFLs, even though 
the derivations obtained with a GIG are necessarily 2-tiered. In this sense, the following theorem 
asserts the strong equivalence of GIGs and CFGs on the class of CFLs. 

Theorem 4 Any unambiguous 2-tiered CFG not recognizing the empty string can be mapped 
to a GIG that constructs exactly the same CF derivations on the same inputs. 

Proof outline Consider the following procedure for building a GIG G' — (V, E, P', S) from 
a 2-tiered CFG G — (V, E, P, S): 

1. For each variable A in V, unwind all possible leftmost derivations by grammar G starting 
at A until either (i) the empty string is derived (A =!=>■ e), (ii) the derived string has a 



leftmost terminal symbol (A act), or (Hi) the leftmost symbol of the derived string is 

a variable of V that has already been replaced in the derivation (A ==> Ba B(3a). 
Let A be the set of derivations thus obtained. 

Note that, when a leftmost e is obtained, the derivation is pursued, if possible, until the 
leftmost symbol is a member of V U E. 

These bounded derivations will be used later on to form addenda in GIG rules. 
2. For each derivation D\ in A, of the form A ABa, for each derivation Di in A, of the 

form B b/3, b in E, replace D\ in A by the derivation A ==> ABa ==> Abfla. 

Here, use is made of the fact that G is unambiguous and 2-tiered to write the yield of 
any self-embedding ^-derivation as ABa. This step puts self-embedding derivations into 
a form suitable to create down-shifts. 
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3. For each pair of derivations (A 



e,B 



aA(3) in A 2 , with a/f, add the deriva- 



tion B =^> aA(3 ^> al3 to A. 

lm ' lm ' 

After this step, each derivation into e is inserted into a derivation whose yield starts with 
a symbol in £ U V. 

From each derivation whose yield has a leftmost terminal symbol, build an e-shift and add 
it to P'. 



The e-shift is built in the obvious way. A derivation D of the form A 



aa is mapped 



to an interpolation rule (A, T,r), where T is the parse graph representing D and r is the 
root of T. 

5. From each derivation D of the form A ==> Aba, build a down-shift (A, T, t), where T is 
the parse graph representing D and t is the leftmost node (labeled A) in the frontier of T. 

Example Consider the 

2-ticrcd CFG {{A 1 , A 2 , B 1 , B 2 , C u C 2 , D u D 2 , E, S}, {a, b, c, d}, P, S), with P consisting of the 
following productions: 

S -» A X B X | CiE 

A Y -» A 2 \e 

A 2 -> a 

B\ — ► B\B 2 | B 2 

B 2 -» b 

Ci -» C 2 D X 

C 2 -» c 

£>i -» DiD 2 I e 

^2 -» d 

£ -» e 



Step 1 The following derivations are inserted into A: 



D 2 

D 3 

Di 

D 5 

D 6 

D 7 

D s 

D 9 

D 10 

D n 

D 12 

D 13 

D 1A 

D 15 



S 
S 

s 
s 

A x 
A x 
A 2 

B! 

B x 
B 2 
Ci 
C 2 
Di 
D x 
D 2 
E 



AxBx = 
A x Bx -- 
A X B X = 
CiE = 

e 
a 

B\B 2 
6 

c 

e 
d 



C 2 DxE 



f aSj 
B\B 2 



cDi 



Note that derivations £> 2 and -D3 involve the e-production for A\. 
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Step 2 The left-recursive derivations D$ and D X3 are replaced by the following derivations. 

D 8 ' : B x BiBa =^ B 1 b 

Di 3 ': I>i =^ D1D2 =r=> Did 



Step 3 The empty derivations -Dg, D14 and Diq are exhaustively plugged into the yield of other 
derivations in A, yielding eventually the following set of additional derivations. 



Dl 7 


S 


lm 


lm 


D 18 


S 


lm 


lm 


D 19 


s 


lm 


lm 


D 20 


s 


lm'" 


lm 


D21 


Ci 


lm'' 


C2D1 = 


D22 




1™ 


DiD 2 = 



C 2 D X E 
C 2 DiE 
C 2 DiE 
C 2 DiE 
> cD x 
=► Did 



cD x E 
cD x E 
cD x E 
cD x E 

c 
d 



cE 

cD 1 

cE 



Note that D19 and D20 are created by plugging an e-production into a derivation created 
during the current step. On the other hand, the effect of A\ — > e is null during this step, 
but was taken into account during Step 1. 

Step 4 All derivations in A except (i) the self-embedding derivations D 2 , Dg and D X3 ' , and 
(ii) the empty derivations Dg, D14, and Di6 are mapped to e-shifts. Note that D19 and 
D20 map to the same addendum and therefore give rise to a single rule. 

Step 5 The self-embedding derivations D% and -D13 are mapped to down-shifts. 

Note that D 2 does not appear as such in the set of interpolation rules P', for it is redundant 
with the subset {Dg, D x , D3, Dg}. On the other hand, empty derivations are not retained 
either, for they are redundant with the subset {D 2 , D 3 , Dn, D xs , Dig, D 2 o, D 2 i,D 2 2}- 



Lemma 1 When the above procedure is applied to produce a GIG G' from a CFG G, all and 
only derivations of G can be constructed by applying rules of G' . 

Proof outline 

To prove that only derivations of G are built, consider that all addenda in the rules of G' 
represent derivations of G. Now, supposing inductively that the context to which a rule of G' 
applies represents a derivation of G, it can easily be shown that the resulting context will then 
also represent a derivation of G, whether the rule applied was a down-shift or an e-shift. 

To prove that all derivations of G can be built by applying rules of G' , we will first of 
all ignore the effect of left recursion and e-productions, and then factor them back in. We 

accordingly define =^ to denote a sequence of zero or more leftmost derivation steps that 

do not involve a left-recursive cycle or e-production, and will explicitly denote derivation steps 
that may involve an e-production as . 

The proof will hinge on leftmost derivations. However, a parse graph represents every possible 
derivation order; therefore, showing that all leftmost derivations of G can be represented by parse 
graphs of G' is sufficient to prove that all derivations of G can. 

1. For any leftmost derivation of G of the form S ==^> xAa xaf3a, if we suppose 

inductively that there is a GIG context that represents S =j- xAa, then, by construction 
(Steps 1 and 4), there is an e-shift of G' that maps this context to one that represents 
S xAa xafla. 

lm! lm! ' 
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2. In a 2-tiered CFG, every leftmost derivation involving a left-recursive cycle contains a sub- 
derivation of the form S =~> x\Aa =~> x\AB(3a x\xiBftu x\X2a r )fia, 
with x\ and X2 in E*, and a in S. 

Now, supposing inductively there is a GIG 

context corresponding to S x\Act x\x 2 a, then there is a down-shift of G' 

mapping it to every possible context representing a derivation of the form 

5 =~> xiAa xiABfla xix 2 B(3a xix 2 a7/3a 

. Now, since every elementary left-recursive cycle is captured by a down-shift (due to 
exhaustive derivation down to a leftmost cyclic occurrence or a terminal node), further 
application of down-shifts can generate all possible derivations not involving e-productions. 

3. To prove that all cases induced by the presence of e-productions are captured, it is sufficient 
to prove that, if S xAa is an e-free derivation of G and A — > e is a member of P, 

then G' can build the derivation S ==>• xAa xa. 

If A appears in the frontier of an addendum built in Steps 1 and 2, then Step 3, together 
with the fact that any e-free derivation of G can be built by G', guarantees that the effect 

of A — > e is captured. Otherwise, there necessarily exists a derivation B ==4> A/3, and 

therefore B Af3 (3 will necessarily be part of a derivation produced during 

Step 1. Eventually, this derivation will either be the topmost part of an addendum or, if 

it is a derivation into e, this derivation, B =~> e, behaves like an empty production and 

its effect is taken into account in the same way as A — > e. Now, since we have made the 
hypothesis that G does not recognize the empty string, there is no infinite regression; that 

is to say that A — > e necessarily applies within a derivation X ==> aB(3 =^> e such 

that X occurs in the frontier of an addendum built during Steps 1 and 2. 

Therefore, every possible incidence of an arbitrary empty production is captured by the 

procedure. 

□ 

If G' builds the same leftmost derivations as G, the language it recognizes is strongly equiv- 
alent to the language recognized by G. □ 



6 Deterministic GIGs and CFGs 

If, for any instantaneous description (Ti,w), where Ti is a GIG context whose yield is x and 
the string to parse is xw, no more than one rule is applicable, the GIG has a deterministic 
behavior. This behavior is interesting for its efficiency; for this reason, this section examines 
the generative power of deterministic GIGs. 

Theorem 5 The intersection of CFLs not containing e and deterministic GILs includes the 
class LL(1). 

Proof outline To start with, we will prove that an LL(1) grammar can be put into Greibach 
Normal Form without losing its LL(1) property. In order to prove this, since an LL(1) grammar 
contains no left recursion, it is enough to prove the following lemma. 
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Lemma 2 Let G = (V, S, P, S) be an LL(1) grammar. Let p = A — > a\Bct2 be a production 
in P and B — > (3\ \ P2 \ ■ ■ ■ \ fin be the set of all B -productions, B ^ A. Let G' = (V, E, P', S) 
be obtained from G by deleting production p from P and adding instead the productions 
A — > a,\(3\a,<2. I 0L\(3icti | . . . ai[3 n a2- Then G' recognizes the same language as G and is LL(1). 

Proof outline To adapt a recursive descent parser that parses according to G to parse 
according to G", inline all calls to the procedure B inside the procedure A. Since these calls are 
nonrecursive, the behaviour of the parser will be unaffected. □ 

Now, a GNF grammar is LL(1) if and only if there is no more than one A-production whose 
right-hand side begins with a for given A and a, or else more than one symbol of lookahead 
would be required to select an A-production. It is easy to see that, if this condition obtains, 
the construction procedure outlined in Section || produces a deterministic GIG equivalent to the 
source LL(1) grammar. Indeed, in any instantaneous description, the anchor for any rule of the 
type described in Section || is the immediate successor of the last lexeme that is present in the 
frontier of the context, so at most one node is eligible as the anchor; now, as at most one rule 
corresponds to the anchor symbol, the GIG that is constructed is deterministic. □ 

Beyond this admittedly rather drab result, it is somewhat arduous to characterize the class of 
deterministic GILs. It should be noted that this class includes some context-sensitive languages, 
such as the language analyzed in section [|, which is recognized by a deterministic GIG. 

On the other hand, one can conjecture that the intersection of CFLs and deterministic GILs 
properly includes LL(1) languages from observing the sequence of context yields obtained during 
a GIG derivation. 

Vertical e-shift A vertical e-shift substitutes a subgraph for a dangling node immediately 
following the last lexeme of Ti, so it maps a context whose yield is of the form xAa to 
a context whose yield is of the form xb(3a. The same mapping can be obtained by a 
leftmost derivation; and, as the decision to take it depends on one token of lookahead, a 
deterministic GIG using only vertical e-shifts can be emulated by an LL(1) parser. 

Horizontal e-shift A horizontal e-shift substitutes a subgraph for a node dominating the last 
lexeme, so the new subgraph starts with an -F-edge, as shown on Figure |. The yield 
mapping corresponding to a horizontal e-shift is of the form xa =4> xa(3a, and cannot 
therefore be emulated by a leftmost derivation. 



=> 



S 

I 
N 



III 



Figure 5: A derivation involving a horizontal e-shift 

Down-shift Either the yield of the anchor is made up of terminal symbols, as in a down-shift 
that encapsulates a left-recursion, or it contains exclusively nonterminal symbols, as in 
the down-shift that was used in Section [|. In the first case, the mapping is of the form 
xya => xyafla, where y is the yield of the anchor. In the second case, the mapping is 
of the form xaict2 =>■ xafi\OL\fiiot2, where ol\ is the yield of the anchor. Neither of these 
mappings can be emulated by a leftmost derivation. 

Right-shift Either the anchor is a dangling node, as in the example that was shown on Fig- 
ure [j], or it has descendants, as will be the case for the rule shown on figure M. These 
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cases correspond respectively to xAa =>• xa[3a and xa => xa[3a. Only the former can be 
emulated by a leftmost derivation. 



V -> V 



N 



Figure 6: A right-shift that applies to an ancestor of the last lexeme 



It seems possible to surmise that all yield mappings that cannot be accounted for by left 
derivations, when present in a deterministic GIG, are likely to allow it to recognize non LL(1) 
languages; but what precedes can in no way be considered as a proof of this, just as a motivation 
for investigating this hypothesis. 



7 Further research 

The discussion that precedes has concentrated on context-free grammars, languages, and deriva- 
tions. This can be seen as a preliminary step to determining the amount of context-sensitiveness 
supported by GIGs. In particular, the contribution of right-shifts and horizontal e-shifts to the 
formal power of GIGs remains to be determined. 

A conclusion of this paper is that the class of deterministic graph interpolation languages is 
possibly too restricted to be satisfactory; so another important issue for further research is the 
complexity of nondeterministic GIG parsing. 



8 Related works 

It is interesting to determine how GIG-based parsing relates to other approaches to word-by- 
word incremental parsing, such as Link Grammars [3T9f|, Dependency Grammars [ Mil92| , or 



Applicative Combinatory Grammars [Mil95] 



Its most distinctive feature with respect with these approaches is probably that GIG parsing 
has as its goal the building of a complete parse tree and each partial parse output is a represen- 
tation of a parse tree. On the contrary, parse trees do underlie the dependency graphs produced 
by a link grammar or a dependency grammar, but the parsing process does not rely on this fact. 
Whether a Link Grammar or Dependency Grammar can be easily expressed as a GIG, and with 
what processing consequences, is a subject for further research. 

On the other hand, incremental parsing based on Combinatory Grammars does not generate 
useful phrase structures, but incrementally builds semantic values, which constitute its real out- 
put. Consequently, meaningful comparison with GIG-parsing cannot be made until a semantic 
component is defined for GIGs. 
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